庖丁解牛分词器是基于lucene的中文分词系统的软件。庖丁解牛简单便捷的文件分割合并工具。庖丁解牛拥有直观清晰的中文界面,允许用户将指定的文件按照自定义大小进行切割,并可以将切割后的子文件合并还原成源文件。庖丁解牛这款软件可以满足用户基本的文件分割合并需求。
科学软件资源导航
Scientific software resource navigation
标签: #Natural language processing
Weka is tried and tested open source machine learning software that can be accessed through a graphical user interface, standard terminal applications, or a Java API. It is widely used for teaching, r
Tregex?is a utility for matching patterns in trees, based on tree relationships and regular expression matches on nodes (the name is short for "tree regular expressions"). Tregex comes with?Tsurgeon,
The Stanford Topic Modeling Toolbox (TMT) brings topic modeling tools to social scientists and others who wish to perform analysis on datasets that have a substantial textual component.
A natural language parser is a program that works out the grammatical structure of sentences, for instance, which groups of words go together (as "phrases") and which words are the subject or object o
Rwordseg: Chinese Word Segmentation
The Natural Language Toolkit (NLTK) is a Python package for natural language processing. NLTK requires Python 3.5, 3.6, 3.7, or 3.8.
主要功能包括中文分词;英文分词;词性标注;命名实体识别;新词识别;关键词提取;支持用户专业词典与微博分析。NLPIR系统支持多种编码、多种操作系统、多种开发语言与平台;定位为在微博为代表的新型互联网的大背景下,面向海量异构互联网信息,研究网络大数据搜索、自然语言处理、社会计算与信息安全等关键技术,以自然语言理解为主要手段进行网络情报挖掘,并进行新应用协议的安全隐患分析。
NiuTrans.SMT is an open source statistical machine translation system, jointly developed by the Natural Language Processing Laboratory of Northeastern University in China and Shenyang Yayi Network Tec
东北大学自然语言处理实验室提供的分词软件